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Abstract 

This paper introduces a new parsimonious structure for mixture 
of autoregressive models. The weighting coefficients are determined 
through latent random variables, following a hidden Markov model. 
We propose a dynamic programming algorithm for the application 
of forecasting. We also derive the limiting behavior of unconditional 
first moment of the process and an appropriate upper bound for the 
limiting value of the variance. This can be considered as long run 
behavior of the process. Finally we show convergence and stability of 
the second moment. Further, we illustrate the efficacy of the 
proposed model by simulation and forecasting. 
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1 Introduction 

The most frequently used approaches to time series model building assume 
that the data under study are generated from a linear stochastic process. 
Linear models provide a number of appealing properties (such as physical 
interpretations, frequency domain analysis, asymptotic results, statistical in- 
ference and many others) 0. Despite those advantages, it is well known that 
real-life systems are usually nonlinear, and certain features, such as limit- 
cycles, asymmetry [I3],[T7], conditional heteroscedasticity [9|, flat stretches, 
bursts |14j and jump phenomena cannot be correctly captured by linear 
statistical models. 
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Since the Mixture Transition Distribution (MTD) was originally intro- 
duced by Raftery [19] for modeling high order Markov chains in the dis- 
crete state space, the broad family of this model have been extended and 
applied for modeling conditional distribution of observations in the con- 
text of nonlinear time series with arbitrary state spaces [3]- This model 
also has been extended to the mixture transition of Gaussian distributions, 
known as GMTD, which contains autoregressive model as a special case, for 
modeling flat stretches, bursts and outliers [14]. Mixture of Autoregressive 
(MAR) model (which has been proposed by Wong and Li |21j ) is a flexible 
generalization of GMTD to model processes with multimodal conditional 
distributions and conditional heteroscedasticity. The important feature of 
MAR model is that it can be considered as the mixture of some stationary 
and non-stationary AR processes and remains stationary. For time series 



in which yt denotes a realization of If, and Ft = (y{Ys : s < t} and 
F{yt\J-t-i) is the conditional distribution of Yt given information of Tt~\- 
Also Ofc, k = 1,---,K are the weighting coefficients (i.e. > 0, k = 
1, ...,K and ^^=1 '^k = 1-) and $(.) is the cumulative distribution function 
of the standard normal distribution. This model is a mixture of K Gaussian 
AR(pfc), k = 1,...,K models [21j. 

The mixture of autoregressive conditional heteroscedasticity model was 
also proposed by Wong and Li [22] to capture the squared autocorrelation 
structure of observations. Berchtold [2j also introduced a new approach for 
modeling heteroscedastic time series with MTD model in which the variances 
of each Gaussian distributions depends on the past time series observations. 
For exhaustive review of MTD model see [3] . 

In the MTD models the contribution of distributions are always fixed 
and it is not sensitive to the past observations. However for real processes 
one might expect better forecast interval if additional information from the 
past were allowed to affect [9] . Another approach to study mixture models is 
to introduce some latent variables {Zt}'^p_^i, which are iid and Yt given Zt 
is independent of {Zg : s ^ t}. Each variable Zt has a discrete distribution 
with support {1, • • • ,K} with probability masses P{Zt = k) = ak, k = 
1,---,K as the weighting coefficients in the mixture model. Since these 
models do not consider the dependency structure of latent variables, the 
dynamics of weighting coefficients can not be modeled. For finite state 



{Yt} 



, If G M, the MAR(i^;pi,p2; ■■■,Pk) is defined as 




(1) 
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space time series, Bartolucci and Farcomeni [T] studied a generalization of 
mixture transition models with hidden Markov models. 

In this paper, we propose a new approach to model conditional distri- 
bution of Yf given past information for nonlinear time series in general state 
space (i.e. Yt e R). We use latent Markov process as an appropriate tool to 
consider the effect of past information and build a parsimonious model; the 
idea of Markov switching models (see Hamilton [12], Mcculloch and Tsay 
[18j ) for process {Yt}^Q. Our new model includes the hidden Markov model 
(HMM) [6] as a special case and it also generalizes MAR model in a sensi- 
ble way. This model makes use of the whole past information to maximize 
the posterior probability of Zt-i (given observed Yq, ■ ■ ■ , Yj-i) and predicts 
the probability of Zt by the Markov assumption of the latent process. Al- 
though using all past observations could increase the complexity of 
the model, we propose a dynamic programming algorithm which reduces the 
volume of calculations for forecasting. We derive the limiting behavior of 
the first unconditional moment of the process, and obtain an upper bound 
for the limit of variance. We also investigate the existence and stability of 
the second moment. 

This paper is organized as follows. Hidden Markov Mixture 
Autoregressive (HM-MAR) model is introduced in section [2l Sec- 
tion [3] is devoted to the statistical properties of the HM-MAR 
model. Section [4] analyzes the efficiency of the proposed model 
through simulation and comparison of the forecast errors w^ith the 
MAR model. Section [5] concludes the paper. 

2 Hidden Markov Mixture Autoregressive Model 

Let Y = {lt}^o ^ sequence of random variables in M where yt is a 
realization of Yt. Also let Tt = (t{Ys : s < t} and F{yt\Tt-i) respec- 
tively represent the sigma-field of all information up to time t, and the 
conditional distribution function of Yt (given past information and a|^*^ = 

a|^*^(yi, ...,yt-i))- In addition {Zt}t>p denotes a hidden or latent process, 
a positive recurrent Markov chain on a finite set E = {1,2, K}. The 
initial conditional probabilities are 

P= {Pi, - ■ ■ ,Pk)' , Ph = P{Zp = h\yo,--- ,yp^i) h = l,...,K, (2) 

with transition probability matrix 

P = IkijII^'xE', (3) 
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in which 



vTy- = P(Zt = i|Zt„i =i), i,j £{1,...,K}, (4) 

and invariant probabiUty measure 

/X = {ai,...,aKy, (5) 

where aj = linif^oo -P(-^t = j)- 

We consider {lj}^o have a Hidden Markov-Mixture Autoregressive, 
HM-MAR(JC,p), model with K normal distributions, p lagged observations 
in the AR processes, if the conditional distribution of Yt given Tt-i is defined 
as follows: 

i. For t = p 

F{yp, Zp = h\Tp-i) = Ph'^i— ■ ), (6) 

ii. For t > p+1 

FivtlJ't-i) 

where a^j^^ = P{Zt = h\Tt^i) and <!>(.) is the standard normal distribution 
function. 

In fact latent random variables {Zt}'^p_^_i determine the contribution of 
distributions in the mixture model and conditioning on Zt. We assume 
Yt is p-tuple Markov, independent of {Zg, s 7^ t}. In other words, by 
conditioning on {Yt-i, ■ ■ ■ ,Yt-p} and Zt, Yt is independent of {Yg, s < 
t — p} and {Zs, s / t}. 

The novelty of HM-MAR model is that the contribution of each distri- 
bution in the mixture structure is not of predefined fixed form. It makes 
use of the all past observations from Yq up to Yt-i. The hidden Markov 
assumption of the process {Zt}t>p, enables us to build a parsimonious model. 

The MAR model [2T] can be considered as a special case of such a HM- 
MAR model dnilT]), in which the transition matrix P of the process {Zt}t>p 
has K identical rows (i.e. p{Zt = i\Zt-i = j) = Oi for all i, j = 1,...,K. 
That is {^t}j~ + ^ are independent and identically distributed) with p{Zt — 
i\Zt-i = j) = ai. 

HM-MAR model will also lead to hidden Markov model in general state 
space where p is considered to be zero in ([7]) (i.e. Yt given Zt, is independent 
of past observations). 



h=l 



Vt - ao,h - ai,hVt-i 



O-p^hUt-p ^ 



(7) 
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3 Statistical Properties of the Model 



In this section, we discuss the statistical properties of the HM-MAR model. 
We propose a dynamic programming approach to calculate conditional ex- 
pectation and variance of the process. We also investigate the long run 
behavior of the first order HM-MAR(iir, 1) process, including limiting be- 
havior of the unconditional first moment, and an appropriate upper bound 
for the limiting value of the variance. Finally convergence and stability of 
second moment is proved. 

3.1 Forecasting 

In HM-MAR model ([MT]), the conditional expectation as the least square 
predictor (page 64 of [7J) of the process 1^ for t > p -|- 1 is obtained by 

K 

E{Yt\Ft-i) = X]"i*^("o,/i + ai,hVt^i + •■• + ap,hyt~p), (8) 

h=l 

where a^^*^ is measurable J-t-i- 

One of the main areas for modeling conditional heteroscedasticity (changes 
in the conditional variance) is the family of ARCH models [11], originally 
proposed by Engle [9] in the context of financial time series. In the class 
of MTD models, MAR [21] and MAR- ARCH [22] models also provide a 
mechanism to capture this effect. However in these models only changes 
in conditional mean of each distribution affect the conditional variance of 
process. The conditional variance of HM-MAR model is given by 

K 

Var{Yt\Tt-i) = {al + {ao,h + ai,hyt-i + ■•■ + ap,hyt-pf) - 

h=l 
K 

"i*^ (ao,h + ai^hVt-i + ••• + ap^hVt-p)}'^ 

h=l 

= (9) 

h=l h=l h=l 

in which /i/j ^ = a{)^h + CLi,hyt-i + ■■■ + o,p^hyt-p is the conditional mean of 
/i— th distribution (i.e. E\Yt\Zt = /i, Y^*^]). Let /i^ be a random vari- 
able which takes values /i/j j with probabilities of' for h = I, ■ ■ ■ , K , 

then Ylh =1 "i* - {J2h=i can be interpreted as the con- 

ditional variance of fit given all past observations. This amount is 
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small (large) when all conditional means are equal (largely differ- 
ent). Relation ([9]) shows the impact of conditional mean fifi^t and 
weighting coefficients a^*^ on the value of conditional variance of 
Yf given all past information. This is the merit of the HM-MAR model 
and its capability to model conditional heteroscedasticity as a function of 
simultaneous changes in the weighting coefficients as well as conditional 
mean of each distribution. 

At each time step t, (in equations ([8]) and ([9])) can be determined 
via a dynamic programming method based on forward recursion algorithm, 
proposed in remark [3. 11 

Remark 3.1. Let = ■ ■ ■ ,ys) for s > r, the weighting functions in the 
HM-MAR model fij^j satisfy 



(10) 



EZ=inyi'\Z,., = m\yl') 
where F{yp, zt\yi ^) is calculated recursively as 
Fiyl,Zt = h\y^,~') = 

^{Vp ,^t~i = "i'\yo )T^m,h'i^{ — — ■ ), (11) 



and recursion starts for t = p by 



i=i o-i hUp- 



Proof. As the hidden variables {Zt}t>p have Markov structure in HM-MAR 
model, we have 

K 

— TD( 7 — h\.,t-^\ — T3{ 7. — h 7 . — 



P{Zt = /i|y*-i) = = h, Zt-i = m|y*- 

m=l 

K 

P{Zt = h\Zt-i = m,yl-^)P{Zt-i = m\yl 



m=l 
K 

J2 P{Zt = h\Zt-i = m)P{Zt-i = m\yl-^] 

m=l 
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where 

K 

F{yl-\ Zt^^ = m\yl-^) = ^ Zt-i = m, Zt-2 = j\yt^) = 

K 

YF{yt-i\Zt-i=m, Zt_2 = J, yt-'')P{Zt-i=m\Zt^2 =j, yo^)F{y'-\ Zt^2 =j\y^^) 
i=i 

K 

= 2^^( )T^j,mF{yp ,Zt-2=j\yo ), 

in which the last equahty imphes by ([Tj) and the recursion begins with ([6]). 

□ 

Another characteristic of HM-MAR is modehng the all past observations 
and benefits from a dynamic programming approach. This will in turn 
minimize the volume of calculations for forecasting. The intermediate results 
and in fact the last state F{yp^i, ...,yt, Zt = h\yi, ...,yp) is stored for 
different values of Zt which could be used to update the process, see 
(flOlfTTD. 



3.2 Stability 

In this section, we investigate the stability of moments for the nonlinear 
process {i^}^o ^^^^ admits a HM-MAR(iC, 1) model. This process is repre- 
sented as a random coefficient autoregressive process of order one, in which 
the autoregressive coefficients are functions of the latent random variables 
,{Zt}t>i, (see Equations ([2])-([5])). Let random variables and azt respec- 
tively ai^Zt take values {oj^i , • • • , ai^x} for i = 0, 1, and {ai , • • • , cfk}, 
where Uij and aj, j = 1, - ■ ■ ,K are used in HM-MAR model ([6]17]) with 
p = 1. We consider 

Yt = ao,Zt + ai^ZtYt-i + (^Zt^t, (12) 

where {£t}t>i is a Gaussian IID(0,1) process, independent of the hidden 
process {Zt}t>i- The conditional distribution of the process Yt in Equation 
(fT2]l is determined as 

K 

F{yt\Tt-i) = ^P{Zt = h\Tt-i)F{yt\Zt = h,Ft-i), 

h=l 
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in which P{zt = h\Ft-i) = is given by remark [3.11 By the Gaussian 
distribution of £t in ([T2|) , we have 



Thus §Eil implies that {lt}^o admits HM-MAR(K, 1) model. 

Notice that the process {5^}j^o '^^^ necessarily a Markov process, 
however the extended process X = {Xt}'^i with Xt = {Zt,Yt = (Yj, ...jYt-pY)' 
is Markov (23j . 

Timmermann [20] derived the moments of a class of stationary Markov 
switching models with state-dependent autoregressive dynamics and con- 
ditional mean, Our approach for deriving the limiting behavior of 
first and second moments of the process Yt is not based on the stationary 
assumption of the model. 
Let's define the K x K diagonal matrixes 

0i = diag{ai^i, ■ ■ ■ , Ui^K), ^ = 0,1, 
a = diag{ai, • • • , ax), 

for possible values of random variables Ui^Zt and azt in equation (fT2]) where 
1 = (1, • • • , 1)' is a X 1 vector. 

Lemma 3.1. Let {Ytj'^Q be a HM-MAR(K,1) process defined by US), then 
for n> 2 



Proof. By the Markov property of {Zt}'^i we have that 
E[ai^ZtW{Zs-,s <t-l}] = E[ai^Zt\Zt-i]- 



So 



E[llai,zAZ, = k]= {J{ai,z,)P{Z2r--,Zn\Z^ = k) 

n 

-- {llai,zjP{Z3,---,Zn\Zi,Zi = k)P{Z2\Zi = k) 

Z2,---,Zn t=2 
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= Ei E ([lai,z,)PiZs,...,Zr,\Z2)}a,,z,P{Z2\Z, = k) 

Z3,---,Z„ t=3 

n 

= E[E[llai,z,\Z2]ai,z,\Zi = k]. 

t=3 

So for vector of conditional expectations of nr=2 '^i,^* given different values 
of Zi, we have the following recursive equation 

/ mt2<^iMZi = M \ ( £^[£;[nr=2«i,^j^2]i^i = i] \ 



V E\^ttl = ^\) V ^[^[nr=2 «i,zj^2]i^i =K\j 

( m[U.t=3^i,zAZ2]ai,z,\Zi = l] \ 



\ E[E\\\U^i,z,\Z2]ai.z,\Z^=K] ) 
( E»=i^[nr=3«i,.^tl^2 = i]ai,i7ru \ 



/ TTll ••• -KiK \ ( ai,i ••• \ / E\^1^^ai^Zt\Z2 = ^ \ 



V T^Ki ■■■ T^KK / V ••• ai^K j \ E[]Xl=3ai^Zt\Z2 = K] J 

( E[\{w^l,z^z2 = l] \ 



V E[Ut=3'^i,zAZ2 = K] J 
in which the recursion starts at t = n — 1 as 

E[ai,zjZn-i = 1] 



(13) 



E[ai,zjZn-i = K] 
( TTiiai 1 • • 



T^lKai,K 



\ T^KlO-l,! ■ ■ ■ T^KKai,K / \ 1 / 
/ TTll ••• -KiK \ I 



/ 1 



T^KK / V 
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Thus the solution of recursive equation (|13p is given by 



□ 



Lemma 3.2. Let {Zt}'^i be a Markov chain starting with invariant proba- 
bility measure fj, defined by (Ejj, then under conditions of the lemma W71\ 



t=2 

Proof. By lemma [3?T1 we have 

n n 

]=E[E\[{ai,zAZ{\ 



n— 1 1 



t=2 
K 



t=2 



k=l 



t=2 



(ai , • • • , ax)' 



ao,i 
ao,2 








ao,K 



E[in=2<^i,Zt\Zp = l] 
E[Ut=2(^i,z,\Zp = K] 



(14) 

□ 



Lemma 3.3. If all eigenvalues of P4>i lie inside the unite circle then under 
conditions of lemma \3.2\ 



i. limm-^oo -E'[nn=2^ ai,z„ao,Zi] = 0, 

a. limt^oo Em=o-^[nn=2^ ai,z„ao,Zi] = f^'(t>o{I - ^<^i)"^l- 
Also if all eigenvalues of P4>i lie inside the unite circle then 
i. limm^oo-E[(nn=2^ai,z„ao,Zi)^] = 0, 

^■ lim*^ooEt.=o^[(nni2'«i.^.«o,zj2] =/x'0g(/-P<^?)-H. 



II. 
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Proof. The first part is an immediate result of lemma 13.21 and Datta (page 
508 of [8 1) and for the second part: 



t m+l t 



m=0 

/x'0oa-^0i)"'l, (15) 



t— >00 ^— ' t—KX) 

m=0 71=2 m=0 



in which the last equality holds by Datta (page 511 of [8]). The rest of proof 
can be done in a similar way by conducting a result similar to lemma [3^2] as 

Emn=2 ^i,zMr] = n'<i>i{p<i>iri. □ 



Lemma 3.4. If E\Yq \ < oo then under conditions of lemma 

t 

E\ 



lim £:[nai,z,lo] =0. 



Proof. By Cauchy Schwarz inequality we have 
t t 



[Cov{J{ai^z^,YQ)? < Var{J{ai,z;)Var{YQ 



2=1 i=l 

by lemma [331 we can deduce that 

t 

Var( 



i=l 



lim V^ar-(TTai,Zi) = 0, 
and since Var{YQ) < oo, so 

lim Cov(nai,z,,Yo) = 0, 

thus 

lim E[T\ai,z.Yo] = lim E[T\ai,zMYo] = 0, 



i=l 



i=l 1=1 



in which the last equality can be verified by lemma 13.31 and the fact that 
E[Yo] is finite by the assumption that -^[^0^] < oo (page 274 of f5]). □ 

Theorem 3.1. Let {Ytj^Q follows the HM-MAR(K,1) model, defined by 
U^) . and the following assumptions hold 
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i. {Zt}t>i is an ergodic Markov chain starting from its invariant proba- 
bility measure /x specified in equation 

a. E[Y^] < oo, 

Hi. All eigenvalues of P(f)i and P4>i lie inside the unit circle, 
then the process is asymptotically stable in mean and 

\im E[Yt] = fX(t>o{I-P<t^ir'l. (16) 

Proof. Iterating equation (jl2p . we get 



Yt = ao^Zt + ai^ZtYt-i + crzt^t 

t-l m-l t-1 

= n «i,^t->(ao,Zt_,„ + cTz,_^et-m) + n^L^t-^^o- (17) 

m=0 i=0 i=0 

Let u = t — i \n pT|) to get 

t-i t t 

n «i.^"^o 

m=0 «=t— m+l u=l 
t-1 m+1 t 

= XI n «l.^«(«0,Zi + C^ZiEt-m) + n «1.^«^0, (18) 
m=0 M=2 u=l 

where the last equahty follows from the strict stationarity property of {Zt}'^i 
(page 35 of |10]). which implies by assumption (i) of theorem. Also by the 
independence assumption of {et} from {Zt}'^i in (fT^ : 



t—l t—m t—1 m+1 

1 E\ 



lim ElY] TT ai,z^az,et-m] = hm S[ V TT ai,zj^[et-m] = 0. (19) 



m=0 u=2 rn=0 u=2 

Thus by lemma 13.41 and (|18t - [TU|) we have that 

t-l m+l 

lim E[Yt] = hm i?[X H ai,z„ao,zJ, 

m=0 w=2 



SO by assumption (iii) and lemma [231 we get (fTUj) . □ 

One interesting feature of Theorem 13.11 is that HM-MAR model could 
consist of some explosive (with ai > 1) and non-explosive autoregressive 
processes and it remains asymptotically stable in mean. 
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Definition 3.1. Let A he the spectral radius of 

A = 1(P0?1)'I = diag{E[alz,\Zt-i = !],■■ ■,E[alzjZt-i = K]). 

Lemma 3.5. Let spectral radius A to be as in definition \3.1[ If \ lies inside 
the unit circle then under conditions of lemma \3.2\ 



t-l m+l ^ / j,2-i 

,1 + At0ol- 



^[(E n «i.-.«o,zJ^] < f < oo. (20) 

m=0 i=2 

Furthermore if E[Yq~^'^] < oo, e > then 



1=1 

Proof. By definition of spectral radius wee liave tliat the absolute values of 
all eigenvalues of A are less than or equal to A, so by the lemma assumption 
about A, we have that E[af ^^\Zt-i = A;] < A < 1 for all values oi k = 
1, - ■ ■ ,K, thus by the method of iterative conditioning 

m+l m+l 
u=2 u=2 



= E[E[alz^^^Ja{Zr}]llalzA,^] 

u=2 

m 

< mll<zA,z,], (21) 

u=2 

in which a{ZY^} = cr{^i, • • • , Zm}- Iterating ([2T]) we get 

m+l 

EiU alzAz.] < A™i?KzJ = A"^/.>gl, (22) 

u=2 

thus 

t— 1 m+l t—1 I jJ2-i 

hm Y.E[\{ alzA^z^ < M>gl( hm ^ A^) = (23) 

r)i=0 u=2 m=0 

Now by Cauchy Schwarz inequality, 

m+l n+1 

-^^[(11 «l.^.«0,Zi) (JJ «l,^,«0,Zi)] 
i=2 j=0 

m+l n+1 

< EiU alz^alzMU ^l,Zj'^0,Zi]i 

i=2 j=2 
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thus 

m+1 n+1 
i=2 j=2 

and summing up for different values of m 7^ n = to oo, 

00 m+l n+1 00 

i?[(n«i,^.«o,zj(n«i,^.«o,zj]< E /^>oiA(™+")/' 

00 / , 2-, 

<(J:/x>2iaM/2)2 = (_^)2. (24) 

m=0 

Now by (USD and dM]) we have 

t-l m+l 
m=0 i=2 

00 m+l 00 m+l n+1 

E[{ J] ai,z.ao,zj'] + 2 ^ J] «i,z,ao,zJ(n «i,^.«o,zJ] 

m=0 1=2 m^n=0 i=2 j=2 

^ 1-A +^4-Ai/2^ i_Ai/2 > ■ 

Now by Holder inequality (page 80 of [5j), 

in which n, w > 1 and 1/u + l/v = 1, so for v = l + e/2 we set u = v/{v — 1) 
and < 00. Thus by inequality ([22]) and the fact that A < 1, we 

have 

t 

hm EiHalzYi] = lim \'-' E[alr,J,'] = 0. 

i=l 

□ 

Thus by lemmas 13.31 and 13.51 , we got the following inequality 

t-l m+l , ,2-1 

J_im yar(E n"i.^-«o.^i)<^( 1^/2 )M/^0o(/ - ^^i)'^!)^- (25) 

m=0 i=2 
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Theorem 3.2. Let {Ytj^Q follow the HM-MAR(K,1) model defined by 
f fi^P with A as in definition \3.1[ If the conditions of theorem \3. 1\ hold and 

i. E[Y^+'] < oo, e > 0, 

a. A < 1, 

then the process has finite second moment and 

liraEiY,') < 2{'-^^f + - P<t>l)-'l. (26) 

Proof. Using (llSp we have 

t-1 m+l t-1 m+l 

m=0 i=2 m=0 i=2 

t t-1 m+l t 

E[llalzYo'] + 2E[{Y, U ai,z,ao,zJ J] «i.^»^o] + 

i=l m=0 i=2 i=l 

t-1 m+l t t-1 m+l 

n «i,^.«o,zi +n«i.^.^o)(j; n ^i^'^z^et-m)], (27) 

m=0 i=2 i=l m=0 «=2 

by independence of Gaussian IID(0,1) process, {ej} from {Zt}, (as indicated 
in (|12p ). we have 

t-1 m+l t-1 m+l 

EiiYl n «i,^.«o,Zi)(^ n ai,z,^z^et-m)] = 0. (28) 

m=0 i=2 m=0 i=2 

Also by Cauchy Schwarz inequahty we have that 

t-1 m+l t 

[coviiY, n«i.^.«o,zj,n«i.^.^o)]' 

m=0 i=2 i=l 

t-1 m+l t 

< yar(^ Y{ ai,z,ao^Zi)Var{Y[ai^z,Yo), 

m=0 j=2 «=1 

lemmas [3.41 and [3.51 implv that limt_^oo V'a'"(ni=i ^i.^i^o) = 0, so by (|25]) 
we have 

t-1 m+l t 

l}m[Cov{{Y, n «i,^.«o,zJ,n«i.^»^o)]' = 0, 

m=0 1=2 i=l 

15 



so we get 

t-1 m+1 

t^oo 



=0 i=2 i=l 

t-1 m+1 t 

lim E[{Y^ W a^,z,ao,z,]]E[\{a^,zM = 0, (29) 



m=0 i=2 i=l 



in which the last equahty follows by lemma 13.31 and lemma 13. 4[ By a similar 
method as for ([29|) we get 



t-1 m+1 

E\{\ 

t—^oo 



lim n ^i,z^<^o,z^et-m})([lai,z,Yo)] 

m=0 i=2 1=1 

t-1 m+1 t 

= lim S[{V TT ai^z,aQ^z,et-m}]E\V\ai^zM=^- (30) 

t— s-oo ^ — ' 

m=0 i=2 j=l 

Thus collecting results, by lemma [331 (!27ti30p we have 

t-1 m+1 t-1 m+1 

m=0 1=2 m=0 i=2 

Now by lemmaEa lim^^oo ^EmioinS' = fia^I-Pcj^D-^, 

so using lemma [331 we get (p6|) . □ 

Remark 3.2. immediate consequence of theorems \3.i\ and W^ is that 
hm Var{Yt) < 2( \+^^f °^ ^ + /.^^(j _ ^^2^-1-^ _ _ P0^)-ii)2. 

t^oo 1 — A ' 

This result can be considered as an appropriate upper bound for the variance 
as we utilize inequality (20\) for the first term of ( (^7| ) by Cauchy Schwarz 
inequality. 

Theorem 3.3. Let {Tj^o follows the HM-MAR(K,l) model defined by 
/ flgj) and <pf = diag{\ai^i\, • • • , |aj^i^|) for i = 0, 1. Also, let conditions of 
theorem \3.S\ hold and all eigenvalues of P4>f lie inside the unit circle, then 
E\\.imt ooY-^] exists and is finite. 



Proof. Let random variable X be defined as 

t m+1 

- \ 



X= lim = lim{V' I ]~r ai,zJ(|ao,ZiI + [(T^iet-jnl) + |TTai,z,'5^oI}- 



m=0 1=2 i=0 
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By monotone convergence theorem (theorem 16.2 of [5J) E[X'^] = hmt^oo E[X'i]. 
By the assumption of theorem l3.21 we deduce that spectral radius of 
hes inside the unit circle, so by a similar method as used to obtain ()20p in 
lemma [331 we have 

t—l m+l ^ '2-1 

a- ^[(E I n -iwt-^\f] < 2(77^^)'- (31) 

m=0 1=2 ^ ' 

So by (jSip .lemma 13.51 and Cauchy Schwarz inequality we have that 

t-l m+l t-1 m+l 

m=0 1=2 m=0 i=2 

^ (l + /.V^l)(l + /.>gl) 

By the assumption, all eigenvalues of Pcf>i lie inside the unit circle, so by a 
similar method as used to obtain (|15p we have that 

t-l m+l 

lim I n ai,z,ao,Ziet-m}I] = cl)+ {I - Pcl)+)-^l < 00. 

m=0 i=2 

Therefor using inequality ([32]) instead of ([28|) in the proof of theorem 13.21 , 
relation ([261) changes to , we get 

lim E[Xn < 2{'-±^f + t^a^I - P^?)-^! + 
2(l + A*''^'l)(l + /^>ol) 



(1 - Al/2)2 

Thus is integrable, so X is integrable. Also by triangular inequality we 
have lYfl < X for all t and thus for all a; G M, lim(_>>oo Yt = Y, where 

00 m+l 00 

^ = {E n "i-^>(«o,Zi +o-Zieo) +^"l.^«^OJ■• 
m=o i=2 i=0 

So by continuous mapping theorem [1] we have that Yj^ — )• Y'^ almost surely. 
Finally < X implies that \YI^\ < I + X'^, so by the integrability of X^, 
and dominated convergence theorem (theorem 16.4 of |5j) we conclude that 
E[limt^Qo Yt] exists and 



E[Y^] = E[lim y/] = lim E[Yi^] < 00. 

t—^oo t^oo 



□ 
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4 Simulation 

The hidden process {Zt} in is assumed to follow a first order Markov 
structure, so HM-MAR can be considered as a generalization of MAR model. 
Clarifying, MAR model can be considered as HM-MAR model with indepen- 
dent hidden process {Zt}. However, HM-MAR model is more complex, using 
the past observations to determine the next coefficients, and demanding a 
longer calculation to estimate the parameters and dynamically updating the 
weighting coefficients. 

In this section, we investigate the efficiency of these models for time 
series where {Zt} follow a first order Markov process. To this end, 100 
observations are generated from the following HM-MAR model: 

F{yt\Tt-,) = + 

(l-aW)^(^^""-^^^-^^"°-'^^-^) 

with p = (1,0)', (that is starting from the first model ^^(^ yt-0-7yt-i-0.2yt-2 ^^^ ^ 
and transition probability matrix P = [0.8077, 0.1923; 0.7619, 0.2381]. We 
used EM [16] algorithm to estimate the conditional probability of hidden 
variable Zt given Yi, ■ ■ ■ ,Yt (i.e. P{Zt\Yi, • • • , Yr)), and Baum- Welch [15j 
algorithm to estimate the joint conditional probability of Zt,Zt-i given 
Yi, - ■ ■ ,Yt (i.e. P{Zt, Zt~\ 111, • • • , It)- Using these estimations we get the 
following HM-MAR model 

/ IT- A (t)^, yt-0.651%_i- 0.2973yf-2 , , 

FHM-MAR{yt\Tt-\) = a\'^[ ^^-^g^;^ ) + 
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Table 1: Sum of absolute forecasting errors by MAR and HM-MAR models 
in 10 iterations 



Iterations 


1 


2 


3 


4 


5 


HM-MAR 


27.7559 


27.7590 


27.7560 


27.7559 


27.7576 


MAR 


75.1827 


75.0973 


75.2079 


75.0839 


75.2378 


Iterations 


6 


7 


8 


9 


10 


HM-MAR 


27.7580 


27.7567 


27.75657 


27.7566 


27.7569 



MAR 75.1054 75.1464 75.1338 75.09641 75.2176 



, _ {t).^, yt - 0.64687/f,i - 0.3050yf_2 . 
^ "1 0.9875 

with p = (0.7261,0.2739)' and P = [0.5905, 0.4095; 0.3331,0.6669], and 

FMAR{yt\^t-i) = 0.3732$( ^'^ri% > + 

0.6278$(^i^MiZ^^i-i^^:^i^), 
^ 0.8640 ^ 

is the estimated MAR model. In figure |H the left figure shows the sample 
path of forecasting errors by MAR model(blue) and forecasting errors by 
HM-MAR model(green). The right one presents the sample path of sim- 
ulated HM-MAR model(red), forecasted observations by MAR(blue) and 
forecasted observations by HM-MAR model(green). We observe that HM- 
MAR model produces significantly smaller forecasting errors than MAR 
model and a better approximation for the time series. In table H] sum of the 
absolute forecast errors for MAR and HM-MAR models for ten iterations 
are presented. 



5 Summary and discussions 

We proposed HM-MAR model as a flexible structure for modeling condi- 
tional distribution of Yt given past observations (li, • • • , lt_i) in a nonlinear 
time series. We considered HM-MAR model as the mixture of some Gaus- 
sian distributions where the mean of each distribution follows an AR(p) 
model. Unlike the ordinary mixture models, the weighting coefficients de- 
termining the contribution of distributions are not of predefined fixed form 
(constant values). These values are conditional probabilities of a latent 
variable Zt given past observations (Yi, ■ ■ ■ ,Yt-i). At each time step the 
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coefficients are determined through maximizing the posterior probabihty of 
latent variable Zt-i given past information. Latent variables are assumed 
to follow a Markov process to build a parsimonious model. A suitable 
application for HM-MAR model is when the process Yj is a result of some 
processes, and the contribution of each process changes over time. If such 
effects are not present in time series then our model automatically will 
reduce to the ordinary mixture models. HM-MAR model will also lead to 
hidden Markov model for continuous process {Yt} where p is zero (i.e. 
given Zt, is independent of past observations). 

Although modeling the effect of all past information makes the model 
complicated, a dynamic programming method is proposed for forecasting. 
It is worth mentioning that it is still possible to study some properties of 
{yj}, such as asymptotic behavior of first moment, existence and finiteness 
of second moment and deriving the upper bound of asymptotic variance of 
process. Although the variances of each distribution in the mixture model 
are constant, the conditional variance of the process in HM-MAR model is 
not fixed. This feature can be used to model conditional volatility effects 
frequently presented in financial time series. Another interesting feature is 
that the first order HM-MAR(i^, 1) model can be considered as a mixture of 
some explosive autoregressive processes (i.e. a.^i > 1) and the non-explosive 
ones (i.e. a.^i < 1). However, it is still asymptotically stable in first and 
second order. 

This work has the potential to be applied in the context of nonlinear 
time series by imposing hidden Markov property for the weighting coeffi- 
cients of mixture model. Also it can elaborate further researches for ex- 
tending the stability results to the case of HM-MAR(K,p), where the lag 
of autoregressive processes is of order p. Stationarity and crgodicity are 
two major aspects. Finally this area of research can be expanded by 
considering other distributions besides the Gaussian as the underling 
distribution of mixture model. 
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